3574 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
220 entries Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Neural Models of Selectional Preferences for Implicit Semantic Role Labeling
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Minh Le | VU University Amsterdam | NL |
| Author 2 | Antske Fokkens | VU Amsterdam | NL |
| Main Contact | Minh Le | VU University Amsterdam | None |
Documentation:
Josef Ruppenhofer, Caroline Sporleder, Roser Morante, Collin Baker, and Martha Palmer. 2010. SemEval- 2010 Task 10: Linking Events and Their Participants in Discourse.
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
ELRA END-USER AGREEMENT
Size:
10 MByte Production Status:
Newly created-finished
Use:
Machine Learning
-
Paper title:Annotating High-Level Structures of Short Stories and Personal Anecdotes
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Boyang Li | Liulishuo Silicon Valley AI Lab | US |
| Author 2 | Beth Cardier | Sirius-Beta | US |
| Author 3 | Tong Wang | University of Massachusetts Boston | US |
| Author 4 | Florian Metze | Carnegie Mellon University | US |
| Main Contact | Boyang Li | Liulishuo Silicon Valley AI Lab | None |
Documentation:
The data are largely self-explanatory. A short description will be available.
Written
Corpus,
Language Type:
Trilingual
Languages:
English Serbian french
Availability:
From Owner
License:
CreativeCommons
Size:
2041113 tokens Production Status:
Newly created-in progress
Use:
Cross-linguistic comparison, machine tranlsation
-
Paper title:TALC-sef A Manually-Revised POS-TAgged Literary Corpus in Serbian, English and French
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Antonio Balvet | UMR STL 8163, Université Lille 3 | FR |
| Author 2 | Dejan Stosic | UMR CLLE-ERSS 5263, Université Toulouse 2/CNRS | FR |
| Author 3 | Aleksandra MILETIC | UMR STL 8163, Université Lille 3 | FR |
| Main Contact | Antonio Balvet | UMR STL 8163, Université Lille 3 | None |
Documentation:
A. Miletic's master thesis dissertation, in French
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
The annotations are licensed under the Creative Commons CC-BY 4.0 license, the original content from ClueWeb12 keeps its original license, the annotation tool is licensed under the GNU GNU General Public License v3.0.
Size:
28 MByte Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:Beyond Generic Summarization: A Multi-faceted Hierarchical Summarization Corpus of Large Heterogeneous Data
-
Paper track:Evaluation
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Christopher Tauchmann | Technische Universität Darmstadt | DE |
| Author 2 | Thomas Arnold | Technische Universität Darmstadt | DE |
| Author 3 | Andreas Hanselowski | UKP Lab, Technische Universität Darmstadt | DE |
| Author 4 | Christian M. Meyer | UKP Lab, Technische Universität Darmstadt | DE |
| Author 5 | Margot Mieskes | University of Applied Sciences, Darmstadt | DE |
| Main Contact | Christopher Tauchmann | Technische Universität Darmstadt | None |
Documentation:
yes, English
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Not Specified
Size:
695059 sentences Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:Cooperative Denoising for Distantly Supervised Relation Extraction
-
Paper track:NLP engineering experiment paper
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Kai Lei | Peking University Shenzhen Graduate School | N/A |
| Author 2 | Daoyuan Chen | Peking University Shenzhen Graduate School | CN |
| Author 3 | Yaliang Li | Tencent Medical AI Lab | N/A |
| Author 4 | Nan Du | Tencent Medical AI Lab | N/A |
| Author 5 | Min Yang | SIAT, Chinese Academy of Sciences | N/A |
| Author 6 | Wei Fan | Tencent Medical AI Lab | N/A |
| Author 7 | Ying Shen | Peking University Shenzhen Graduate School | N/A |
| Main Contact | Daoyuan Chen | Peking University Shenzhen Graduate School | None |
Documentation:
'conference paper ''Modeling relations and their mentions without labeled text'''
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
approx. 33,000 words Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
-
Paper title:TagNText: A parallel corpus for the induction of resource-specific non-taxonomical relations from tagged images
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Theodosia Togia | University of Cambridge | GB |
| Author 2 | Ann Copestake | University of Cambridge | GB |
| Main Contact | Theodosia Togia | University of Cambridge | None |
Documentation:
http://www.cl.cam.ac.uk/~tt309/README
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
326 <Not Specified>Production Status:
Existing-used
Use:
Discourse
-
Paper title:A corpus of general and specific sentences from news
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Annie Louis | University of Pennsylvania | None |
| Author 2 | Ani Nenkova | University of Pennsylvania | None |
| Main Contact | Annie Louis | University of Pennsylvania | US |
Documentation:
yes, english, http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2008T05Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons
Size:
3938 articles OtherProduction Status:
Newly created-in progress
Use:
Evaluation/Validation
-
Paper title:Annotating Spin in Biomedical Scientific Publications : the case of Random Controlled Trials (RCTs)
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Anna Koroleva | LIMSI-CNRS | FR |
| Author 2 | Patrick Paroubek | LIMSI-CNRS | FR |
| Main Contact | Patrick Paroubek | LIMSI-CNRS | None |
Documentation:
annotation manual in English and a Document Type DefinitionLanguage Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Document Classification, Text categorisation
-
Paper title:Sublanguage Corpus Analysis Toolkit: A tool for assessing the representativeness and sublanguage characteristics of corpora
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Irina Temnikova | Qatar Computing Research Institute | BG | Qatar Computing Research Institute, HBKU | QA |
| Author 2 | William A. Baumgartner Jr. | University of Colorado School of Medicine | US | U. Colorado School of Medicine | US |
| Author 3 | Negacy D. Hailu | University of Colorado School of Medicine | US | ||
| Author 4 | Ivelina Nikolova | Bulgarian Academy of Sciences | BG | ||
| Author 5 | Tony McEnery | Lancaster University | GB | ||
| Author 6 | Adam Kilgarriff | Lexical Computing Ltd. | GB | ||
| Author 7 | Galia Angelova | Bulgarian Academy of Sciences | BG | ||
| Author 8 | K. Bretonnel Cohen | University of Colorado School of Medicine | US | ||
| Main Contact | Irina Temnikova | Qatar Computing Research Institute, HBKU | None | Sofia University | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English Mandarin Chinese
Availability:
Freely Available
License:
GPL
Size:
20000000 sentences Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Dual Subtitles as Parallel Corpora
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Shikun Zhang | CMU | US |
| Author 2 | Wang Ling | CMU-LTI & INESC-ID | US |
| Author 3 | Chris Dyer | Carnegie Mellon University | GB |
| Main Contact | Wang Ling | Google DeepMind | None |
Documentation:
<Not Specified>




